Optimal phase measurements with pure Gaussian states 
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We analyze the Heisenberg limit on phase estimation for Gaussian states. In the analysis, no 
reference to a phase operator is made. We prove that the squeezed vacuum state is the most 
sensitive for a given average photon number. We provide two adaptive local measurement schemes 
that attain the Heisenberg limit asymptotically. One of them is described by a positive operator- 
valued measure and its efficiency is exhaustively explored. We also study Gaussian measurement 
schemes based on phase quadrature measurements. We show that homodyne tomography of the 
appropriate quadrature attains the Heisenberg limit for large samples. This proves that this limit 
can be attained with local projective Von Neuman measurements. 



I. INTRODUCTION 



For a long time quantum phase measurements have 
been a field of attention for both theoretical and experi- 
mental research in q uantum th eory 
E M, IH IH IH El El in in D3 Many schemes have 
been proposed with the aim of optimally determining the 
phase shift produced by a physical process 01 > usually 
focusing on quantized harmonic oscillators. Despite all 
the efforts, no attempt has ever settled the issue with full 
generality Many different approaches exist but they 
often have a restricted range of applicability. There are 
various reasons for this situation. The most important 
one is the fact that a well defined phase operator does 
not exist on the whole Fock space. This seems to indi- 
cate that Heisenberg's uncertainty relation for phase and 
number may not be always taken at "face value" and a 
rederivation without the use of such a phase operator is 
called for (see ^3 and references therein) . 

In this paper we are concerned with phase measure- 
ments on pure Gaussian states. First, we review the 
derivation of the Heisenb erg limit using the theory of 
quantum state estimation 0,H(|, which does not require 
the existence of a phase operator. Then we propose a pos- 
itive operator-valued measure (POVM) that attains the 
limit by an adaptive procedure [U |23, H3| , for asymp- 
totically large number of copies. We also determine the 
optimal Gaussian measurement for phase estimation in 
this asymptotic regime. 

When performing phase measurements on Gaussian 
states, one usually implements them on light beams with 
a given average photon number, i.e., with a fixed energy. 
This constraint is always assumed when comparing differ- 
ent Gaussian states in order to determine their optimal- 
ity for phase estimation. The techniques we use, namely, 
asymptotics in quantum statistics, are best suited for the 
problem at hand, where one usually has a light beam 
turned on for a large number of coherence times, which 
translates into having a large number of identical copies 
of such Gaussian state. 



II. QUANTUM PARAMETER ESTIMATION 

In this section we review the theory of quantum pa- 
rameter estimation |6|, |2fj, |2jj, |2J, l25l |26j , which leads 
to a useful generalization of the Heisenberg uncertainty 
relation. 

Let us assume we have a family of pure states p{9) 
labelled by a parameter 9. In our case the parameter is 
the phase. This family corresponds to a curve in Hilbert 
space, and translations along such curve are generated 
by a hermitian operator G in the usual manner. In our 
case the generator is the number operator G = a^a: 



piO) = cxp(-i9G)p(0)ex.p(i6G). 



(1) 



Throughout this paper we use the following notation. 
We will write \n) to denote Fock states, i.e., states with 
well defined number. With Uq we denote the unitary op- 
erator that yields the state p{9) from the vacuum, namely 



p(6)=U e \0)(0\ul 



(2) 



We define the vectors 
by Kg, 



b n ) as the Fock states transformed 



= U e \n) 



(3) 



hence, p{6) = |0o)(M 

Our aim is to determine the parameter 9 by perform- 
ing the best possible POVM measurement on the sys- 
tem. For this, we propose an estimator 9(x), which is a 
function of the possible outcomes x of the measurement. 
Hence, the expectation value of such estimator is [27J 



E [6} = J2p(x\6)Hx), 



(4) 



and its variance 

Var e [0] = ^2p(x\e)(0(x) - E e [<9]) 2 , (5) 

X 

where the subscript 9 means that the expectation value is 
taken with probability distribution p(x\9). In the quan- 
tum case, they are given by the Born rule p(x\9) = 
tr[p(9)E x ], where {E x } are the elements of a POVM. 
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An estimator is called unbiased when 
E e [9] = e. 



(6) 



In this case, the variance is equivalent to the mean 
squared error (MSE): 

MSE[<9] ee E[(0 - 9f] = Var[0] + (E[0] - 9) 2 . (7) 

A well known theorem in statistics gives a lower bound to 
the variance of any unbiased estimator. It is the so-called 
Cramer-Rao bound, (23 



Var e [0] > 



1 



F{9\ 



(8) 



where F(9) is the Fisher information associated with the 
measurement, 



F{6) = P( X W 



i£X 4 



d\ogp(x\> 
89 



(9) 



and the sum runs over the set of possible outcomes X + , 
i.e. those with p(x\9) ^ 0. 

Furthermore, the Braunstein-Caves inequality [2(| sets 
an upper bound on the Fisher information: 



F(9) < H(8), 



(10) 



where H(9) does not depend on the specific measure- 
ment being performed. H{9) is sometimes regarded as 
the Quantum Fisher Information (QFI). It is defined as 



H{9) = tv[p(9)\{9f 



(11) 



where A is the symmetric logarithmic derivative (SLD), 
defined as the hcrmitian operator that fulfills 



d P {9) 
89 



-i[G,p(6)) 



-[X(9)p(9) + p(8)X(9)j 



(12) 



Eqs. (JHl and i|l(J|) set a fundamental bound on the vari- 
ance of any unbiased estimator. This bound only depends 
on the geometrical properties of the curve p{0). In our 
case, the SLD is 

\{0) = 2i - \f){<h\) = -2i[G,p(9)}, (13) 

where 

|0> = (l-|0o)(0o|)G|0o>. (14) 



The eigenvectors of A are = \ip)±iy/ \4>o), with 
the corresponding eigenvalues given by l± = ±2-^/ (ip\ip). 
Note that (f^l^o) = 0. Working out the value of H wc 
get 



H(9)=4(^)=4(AG)l 



(15) 



where (AG)g is defined, as usual, as (</>o|G 2 |0o) 
(</>o|G|0o> 2 . Thus, 



Y & Te[0]{AG) 2 e > ~. 



(16) 



This expression is similar to Heisenberg's uncertainty re- 
lation for canonically conjugate variables, but has some 
advantages (see ^^)- First of all, it has been derived 
without the use of any phase operator. In fact, the only 
operator we need is the phase shift generator, i.e., the 
number operator. On the other hand, it sets a lower 
bound on the variance of an estimator, whereas the stan- 
dard uncertainty relation does not concern optimality but 
only variances obtained from measurements of observ- 
ables (i.e., self-adjoint operators). In this sense, Eq. i|TT)|) 
is more general. 

Now assume we have N identical copies of the same 
unknown state p(9). In this case the collective state 
p(9)® N is still a member of a one-parameter family, and 
it is straightforward to show that the corresponding QFI 
scales as N. Thus combining Eqs. JSJ and l|10|) . with H 
replaced by the iV-copy QFI and 9 denoting any unbi- 
ased estimator based on any measurement (collective or 
individual) on the N copies, we have 



Vax N , e [0](AG) 2 e > ^ 



(17) 



where the subscript N stands for the number copies. 

An important issue is the attainability of these bounds. 
The Braunstein-Caves inequality is known to be sat- 
urable Hfllll, i-e., there exists a POVM that gives the 
equality in (|10|) [it is given by A(#), see Sec. IIVI for de- 
tails]. Note, however, that in general X(0) is not constant 
and the optimal POVM depends on the true value of 9. 
This suggests that, in order to attain the equality in i|l(J|) 
for large N, one needs an adaptive scheme. The mea- 
surement at a given step of the estimation scheme may 
need to be optimized using the data gathered from the 
previous steps. The Cramer-Rao bound is also known to 
be asymptotically saturable by the so-called maximum 
likelihood estimator (MLE). That is, for multiple iden- 
tical measurements, the MLE has a variance which ap- 
proaches the inverse of the Fisher information, thus at- 
taining equality also in JSJ. All this seems to indicate 
that the optimal scheme should consist of at least two 
steps: A preliminary rough estimate of 9, using a van- 
ishing fraction of copies (N a , with < a < 1), and 
a refinement using the remaining N = N — N a copies, 
where all measurements are identical in the last step. 

Therefore, if N — > oo one can reasonably expect that 



/V 



lim NVaXjf 6 [6] 



A(AG)\ 



(18) 



for the MLE, provided the optimal POVM measurement 
is performed in step two. Here the subscript N indicates 
that the variance is obtained from the outcomes of the 
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last TV identical measurements. This will be loosely writ- 
ten as 

1 



Var e [0] 



(19) 



4<AG) 2 ' 

We show below that l|19|) holds if the number of copies 
used in each of the two steps of the estimation scheme 
(i.e., a) is appropriately chosen. 

A word should be said about asymptotically unbiased 
estimators. These are estimators whose expectation val- 
ues converge to the parameter in the large sample limit, 
but have some vanishing bias, i.e., 



E[0] = + 0(N~ 



(20) 



with a > 0. In this case the correct quantifier of the sen- 
sitivity is the MSE. If a > 1/2 the MSE is asymptotically 
equivalent to the variance, since 



MSE[0] = Var[0] + 0(N~ 2a ), 



(21) 



and we note that the leading order contribution in inverse 
powers of N is solely given by Var[0]. 

III. OPTIMAL GAUSSIAN STATE 

Next we determine the optimal Gaussian state, which 
provides the highest sensitivity to phase measurements 
within our approach, i.e., the one that maximizes its QFI. 
The state is obtained by sequentially applying a series of 
operations on the vacuum: (i) a squeezing along a fixed 
direction (say Q), S(r) = exp[(r/2)(a 2 — a^ 2 )]; (it) a dis- 
placement, D(a) = exp (chcl* — a* a) with real a; (Hi) the 
unknown phase shift, U(6) = exp (— iOaJa). The state 
after applying these operations is 



\<h)=U(fi)D(a)S(r)\0), 
and its QFI is given by 
H(9) = 4(Aa^a) 2 g 



(22) 



(23) 



\a\ 2 (coshr — sinhr) 



2 sinh 2 r cosh 2 r 



Notice that actually H does not depend on the phase. 
The energy of the state l|22|) is 



(n)g = (cr a)e = |ct| 2 + sinh 2 r. 



(24) 



We aim at maximizing H with the constraint to have a 
fixed average number of photons. Using Lagrange multi- 
pliers it is straightforward to find that the most sensitive 
choice is a — 0. As intuition dictates, the highest sen- 
sitivity is achieved by employing all available energy in 
squeezing the vacuum. Therefore, the optimal gaussian 
state is |0 O > = U(9)S(r)\0), for which 



H = cosh4r - 1 = 8((r 



<n». 



(25) 



Hence, optimal phase estimation has a variance that goes 
as 

1 



Var[0] 



8«r 



(26) 



IV. ASYMPTOTICALLY OPTIMAL 
FEW-OUTCOME MEASUREMENT 

Following the derivation of the Braunstein-Caves in- 
equality |2(| . one can check that the optimality condi- 
tions on the POVM elements {E x } are (see pffi 



Im{tr[ j o(0)£; x A(0)]} = 0, 

^^(0)A(0)V^ = k*yfm« 



(27) 



E, r 



for all x that have non- vanishing probabilities, where k x 
are some constants. Assuming pure states p 2 — p, and 
the fact that any POVM can be decomposed into rank 
one operators (so that y/E x oc E x ), one may write the 
second optimality condition as 



p(9)X(d)E x = k xP (d)E x . 



(28) 



Finally, one can check that a sufficient condition for 
Eqs. and to hold is that {E x } project onto the 
eigenspaces of the SLD. Therefore, an optimal POVM 



can be chosen 
given by 



to have three elements, {E + , E-, Eq}, 



E± - 



2(V#> 



and 



E = l- E+- E-. 



(29) 



(30) 



For completeness we give the explicit form of the 



states 



They are 



1^) = V2 sinhr cosh r(±i\4> ) - \ fa)) , (31) 
which, after normalizing become 

|0±) ±# )-|</>2) „ ±t|0)-|2) 



V2 



V2 



(32) 



This measurement resembles tomography applied to es- 
timation of spin states lying close to the z axis of the 
Bloch sphere, where spin measurements along the x and 
y directions are optimal 0, |2^ • 

From the hermiticity of A it is straightforward to check 
that {E + , E-, Eq} are positive and, furthermore, that 
they represent projective von Ncuman measurements. 
The outcomes of these measurements will be used in the 
maximum likelihood analysis of the next section, where 
its corresponding expectation value and MSE will be dis- 
cussed. 



V. MAXIMUM LIKELIHOOD ESTIMATION 

Assume one has performed step one, a series of non- 
optimal measurements on a vanishingly small number of 
copies, N a , of our optimal Gaussian state \(j>o), and has 
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obtained a rough guess 9 of its phase. Let us write 
9 = 0—59, where 59 represents the error in this first step, 
which is assumed to be small. Let |0 O ) be the guessed 
state. We design (step two) the optimal measurement on 
the remaining N = N — N a copies as if 9q were the true 
phase. 

Let N± o be the number of times the outcomes ±, are 
obtained. The likelihood function for 9 is H3 



C(9) = 



N) 



p(+\o) n+ p( 



N + \NJ.N \ 



KOlfl)^ . (33) 



Numerical maximization of this function yields the 
strict MLE. Recall that this is known to attain the 
Cramer- Rao bound in the limit of large number of copies. 
We can, however, analytically compute an approximate 
estimator, which converges to the MLE in the large 
sample limit and has the same nice properties, i.e., it 
is asymptotically unbiased and attains the Cramer-Rao 
bound asymptotically These properties will be checked 
explicitly. 

Taylor expanding the true state around the first-step 
guess we have 

p(9)^p(9 ) + ^[x(9 Q )p(9 )+p(9 )X(9 )]59 + O(69 2 ),(M) 

and the probabilities of obtaining the outcomes x = ±, 
are 



fixed value of iV; n f we have 



MLE 



JVi„f 

E< 

N + =0 



2q- 1 



u 2(An)' 

where q is, to the relevant order, q = 1/2 + (An) 
0(59 2 ). This yields 



(39) 



E 



6,Nj 



J: 



MLE. 



= 9 + 0(69 2 ). 



(40) 



Since 59 2 = 0(N~ a ), the expectation value is asymptoti- 
cally unbiased [recall Eq. (|2(Jfl ]. It is also straightforward 
to compute the MSE of this estimator, for a given number 
Ni n f of informative outcomes : 



MLEJ 



^-L_ +0 (f ) +0(i n (4i) 



for N- m { > 0, and 



MSE e , o [0ML E ] = 59 2 , 



(42) 



for N int = 0. 

To compute the full MSE of our scheme we just need 
to average over all possible values of iVj n f. This yields 
(see Appendix El for details) 



p(±\9) = ±±J(M)59 + 0(59 2 ), (35) 



p(0\9) = 0(59 2 ), 



(36) 



where |$) is given by Eq. I|14|) by replacing \<p ) for |^> ) 
With this, = (An) 2 . Thus, 



<91og£ 



E 



N x 8p(x\9) 



89 ^ , p(x\9) 89 



(37) 



4(Ar 



AT, - N_ 



- {N+ + N-)(An}56 



0{59 2 ). 



Equating this to zero yields the approximate MLE at 
leading order in 59: 



'MLE 



2N iaf (An) 



(38) 



Here Ni n f = N+ + iV_ is the total number of "informa- 
tive" outcomes. This estimator is based on disregard- 
ing the improbable and noninformative outcomes x = 0. 
Our approximate MLE is undefined in the rare event that 
Mnf = 0. In this case we may just keep the preliminary 
estimate and define 6>mle = #o- 

Disregarding the unlikely x = outcomes, the estima- 
tor is just based on a binomial distribution, and for a 



MSE e [0 M L E ] = M SE0,jv inf [fW] (43) 

iV inf =0 



N 



1 

N 



N int 



4(An> 



(1-P) 



0{59 2 



N-Nlnt 



(44) 



where p is the probability of getting an informative out- 
come in a given measurement [p = 1 — 0(59 2 )]. Clearly 
enough, by choosing a > 1/2 the error of the first step 
only contributes to subleading orders in the variance. 
If, e.g., a = 2/3 one sees that in the large N limit, 
the MSE[9] reduces to 



MSE0 [9 



1 



MLE = 



8((r 



(n» N 



+ o 



AT4/3 



(45) 



Hence, the optimal performance displayed in l|26|) is 
asymptotically attained. 



VI. ASYMPTOTICALLY OPTIMAL GAUSSIAN 
MEASUREMENTS 

Now we face the problem of determining the mini- 
mal phase variance that can be attained by means of 
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dyne measurements. These consist of the simultane- 
ous measurement of two conjugate variables, such as Q 
and P, or their phase generalizations U(9')QU^(9') and 
U(0')PW{0'). They are typically performed by split- 
ting the signal state. Often this is done by means of 
a beam splitter, the so-called eight-port homodyne de- 
tector [2||. Another possibility is heterodyne detection. 
In any case, the Arthurs & Kelly theorem (see, e.g., |2jJ) 
guarantees that further noise will appear in the detection 
process. In the eight-port homodyne detector it appears 
as vacuum fluctuations entering through the unused port 
of the beamsplitter, which is sometimes called auxiliary 
port. By introducing some sort of squeezed vacuum in the 
auxiliary port, one can reduce the noise in one quadra- 
ture, at the expense of increasing the noise in the cor- 
responding conjugate quadrature. These measurements 
can be mathematically expressed as a covariant measure- 
ment 0, Q with POVM elements given by 



From H47|) the probability density is 



(46) 



where (q,p) = x are the outcomes and represent coor- 
dinates in phase space, and D(x) is the displacement 
operator D(x) = expz(pQ — qP). The density matrix 
do is the ancilla entering through the auxiliary port of 
the beamsplitter. From the covariance of the measure- 
ment, o~o can be taken to be a squeezed vacuum state 
((<?) = (p) = 0) without loss of generality. One can relate 
the probability of obtaining the outcome Xi dp(xW)i with 
the fidelity H3, T(a x ,p{9)) = tr ^(f) 1 / 2 ^/)^) 1 / 2 , as 



dp( X \0) = tr[p(9)E(x)}d 2 x 

Z7T 



When do is a Gaussian state the measurement is said 
to be Gaussian and the outcomes are Gaussian dis- 
tributed. Moreover, when one of the two states is pure, 
the fidelity can be easily expressed as T[o Xl p{0)) 2 = 

2 v /detM(<9)e- x * M W* & where M{9) = ( 7o + le)' 1 ■ 
The covariance matrices of the input and the auxiliary 
states are 



with 



7e 
7o 



R(0) 



R\B) S R(9), 
J?*(0') T R(6') 



cos a — sin t 
sin 9 cos 9 



S = 









1/s 2 



T = 









1/t 2 



(48) 
(49) 



(50) 



(51) 



where s = e~ r and 9 (t = e~ r and 9') are the squeezing 
parameter and phase of the input state (ancilla). Re- 
call that the diagonal matrix elements of the covariance 
matrices give the variances of the canonical observables 
(AQ) 2 = [ 7e ] n /2 and (AP) 2 = [79)22/2. 



= V<ktM(fl) exp[ _ x t M{dU (52) 

The Fisher information of such measurement can be read- 
ily computed from @: 

F(9) = - ti[M'M- 1 M'M- 1 ], (53) 

where M' denotes the derivative of M(9) w.r.t. 9 and 
A/" 1 its inverse. The 9 dependency has been dropped 
for readability reasons. Further manipulations show that 
the Fisher information can be written as 

F(0) = ltr[r- 1 Sr- 1 S] ] (54) 



with 



and 



T = S + R\e' -9) T R{9' -9) 
= (cosh2r + cosh 2/) 1 

— (sinh 2r + sinh 2r' cos ip)<Ta 

+ sinh 2r' sin (/? <j\ (55) 



S = R\-k/2) S + S R(tt/2) = 2sinh2r cr u (56) 



where, as expected, <p = 2(9' — 9) is the only relevant 
angular parameter, and <r,j (i — 1,2,3) are the standard 
Pauli matrices. With this, F(9) reads 



(47) F(9) = 



2 sinh 2 2r 



(cosh 2r cosh 2r' — sinh 2r sinh 2r' cos <p + 1 ) 2 

x (cosh 2r cosh 2r' — sinh 2r sinh 2r' cos <p 

+ sinh 2 2r' sin 2 ip + 1). (57) 



Here the dependence of ip on 9 is implicit. To optimize 
the measurement, we find the values of 9' and r' that 
maximize the Fisher information. Imposing dF/dip = 
we find three possible extremal points: ip = and tp — 
±<po(s,t) [see Eq. (|dT7|) below for the expression of <po\. 
We will drop the arguments of ^0 where no confusion 
arises. The angle tp = is a maximum if the squeezing 
in the auxilliary port of the beamsplitter is weaker than 
a threshold, 1 < t < <thr(s), given by 



tthr(s) = 7^\A 4 -1 + Vs 8 + 14s 4 + 1. (58) 

In this case, this is the only extremal point of F(9). When 
t > t t hr the trivial solution tp = becomes a minimum 
and the solutions tp = ±tpo become maxima. 

The maximum Fisher information below the threshold 
(1 < t < t thl ) is 



HO') 



i 2 (l-s 4 ) 



4\2 



sinh 2 2r 



s 2 {s 2 + t 2 ) 2 cosh 2 (r-r')' 



(59) 
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As a function of t, the maximum of F(9) is at the critical 
point t = i t hr- 

Over the threshold (t > t t hr) the optimal angle is given 

by 



COS ifio 



2 cosh Ar' sinh 2r + cosh 2r' sinh Ar 



(3 + cosh Ar) sinh 2r' + 2 cosh 2r sinh Ar 



7, (60) 



tpa > [recall the definition of s and t right below 
Eq. (| 5 1 p ] . According to the definition of (p, we have 
9' = 9 ± ipo/2. The maximum Fisher information is 



F (9' T </V 2 ) = sinh 2 2r 

3 + cosh 4r + 8 cosh 2r cosh 2r' + 4 cosh 4r' 



(61) 



4(cosh2r + cosh 2r') 2 
As t — > oo (r' — > — oo), the optimal angle becomes 

$(s) = lim^o(M) (62) 

arccos(— tanh 2r), 



arccos 



1 



and the Fisher information approaches the value 



lim F (9' T <Po/2) 



(1 



D 4\2 



2s 4 



cosh4r — 1. 



(63) 



This is precisely the Heisenberg limit found in Sec. HP [see 
also Eqs. (|25J) and J2EJ]. Hence, optimal phase sensitivity 
is attained with a POVM measurement given by Eq. I|46|) 
and an auxiliary state <jq characterized by t — > oo and 
0' = 9 ± 3>/2. This is an infinitely squeezed state phase- 
shifted w.r.t. the true phase. In Sec. IVIIII we present a 
two-step adaptive implementation of this measurement 
which can be derived proceeding along the same lines as 
in Sec. [3 This scheme also attains the Heisenberg limit. 



VII. INTERPRETATION OF THE OPTIMAL 
GAUSSIAN MEASUREMENT 

In this section we provide an interpretation of this ap- 
parently unphysical proposal of a Gaussian measurement 
with infinite squeezing in the auxiliary port. Introducing 
an infinitely squeezed ancilla in this port is equivalent 
to having zero noise in one quadrature, say P' , at the 
expense of introducing infinite noise in the correspond- 
ingly conjugate one, Q' . This makes the readings of Q' to 
be just random noise with no information about the sig- 
nal state. Therefore, one has only a measurement of P 1 . 
This strongly supports the idea that, after all, only ho- 
modyne detection is needed to implement the measure- 
ment under discussion. Below we provide a mathemati- 
cal justification of this interpretation, and show that the 
marginal POVM elements, i.e., those obtained by inte- 
grating Eq. (|46|l over q', become exactly the rank-one 
projectors of the P' measurement. 



Let us start by giving the precise definition of the ob- 
servables Q' and P': 

Q' = U(9')Q C/ f (9') = Q cos 9' -P sin 9', (64) 
P' = U(9')P U\9') = Qsin0' + Pcos0'. (65) 

This is just a rotation in phase space, thus the coordi- 
nates transform in the same manner 

X = R(0')X, (66) 

and the displacement operators D{\) transform as 

D'( X ') = U(9')D(x!)tf(e') = D( X ). (67) 

Let \Q;q) and \Q';q) be the eigenstates of Q and Q' re- 
spectively, where q is the eigenvalue, so that \Q']q) — 
U(9')\Q;q). Since a is 

a Q = U(9')S(r')\0)(0\SHr')UH0'), (68) 
the POVM elements of Eq. (|46|l can be cast as 



E (X) = T^U(9')D( X ')S(r')\0)(0\S\r')D\ X ')U\9') 

12 \Q';qi){Q';q2\ 



dq\dq 2 
2ir 

xe*'^-*^( 9l - g ')f(?2-9'). 



(69) 



where we have introduced the squeezed-vacuum wave 
function ^(q), defined as ip{q) = (Q;q\S(r')\Q). 

We can compute the marginal POVM elements F(p') 
by integrating E(\) over dq' , 



Hp') 



dq'E( X ) = J dq'EiR- 1 ^') 

IQ';<7i)<Q';?2| 



dqidq 2 . , 



2tt 



X e*'(«-») j dq'i>( qi - q'W{q2 - q')- (70) 
Performing the dq' integral yields 



dq'^{qi - q'W(q2 - q') = exp 



where we have used that 1^2 



r'/2 / p 2r' 



„2r' 



"(?1 - 92) 2 



^(«) = ^T74 ex P ( —1 



(71) 
(72) 



With this, the marginal POVM reads 

f(jp') = J^m^m* 

x exp 



2r' 



ip'iii - 92) — 7- (qi - 92) 



(73) 
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which in the limit 



-oo converges to 



F(p') = \P';p'){P';p'\. 



(74) 



By rotating the canonical operators, Q' and P' we have 
diagonalized the covariance matrix of <jq, and got rid 
of the correlations between Q and P. We thus put all 
the noise introduced by the auxiliary state into the Q' 
quadrature, and put all the information of the signal 
state into the P' quadrature. This enables us to use the 
marginal POVMs without loss of information, which in 
turn means that it suffices to measure the P' quadrature, 
i.e., the informative one. 

Another interpretation follows from pflj . where it is 
shown that introducing a squeezed state in the auxil- 
iary port is equivalent to leaving the vacuum and tun- 
ing the beamsplitter to a transmittance/reflectance other 
than 50%/50%. The limit of infinite squeezing amounts 
to having a completely transmitting or reflecting mirror, 
which again converts heterodyning into homodyning. 



VIII. ESTIMATION SCHEME 

The analysis carried out so far tells us that opti- 
mal phase measurements can be performed by means 
of homodyne tomography of the appropriate quadrature 
9' = § ± $(s)/2, where $(s) is given by Eq. (J^ and 9 
is the estimate of the first step measurement, which is 
not assumed to be optimal. Let us call this quadra- 
ture P'. The outcomes will be gaussian distributed with 
zero mean (recall that our signal state is a squeezed vac- 
uum) and variance given by 



2 _ l + s 4 + (l-s 4 )cos2(6»' 



As 1 



(75) 



as can be seen by diagonalizing the M matrix in Eq. i|52|) 
through the transformation x' — R{9')x- 

It is straightforward to check that the Fisher informa- 
tion provided by this distribution is exactly that given 
by Eq. (|53|. 

The maximum likelihood analysis of the data yields the 
condition a 2 (9', 9 M le) = J2Pi/N, where {pi,_. . . ,p$} is 
the set of outcomes corresponding to the N measure- 
ments of the second step. Thus 



'mle = arccos 



4a 2 £p?/jV-l-s 4 



(76) 



and the MLE is twice degenerate. There is, however a 
trivial way to break this degeneracy by using the out- 
comes of the first step. The prescription is to choose 
the solution closest to the rough guess 9$. This estima- 
tor has an asymptotically vanishing bias, whereas that 
corresponding to the other solution has a constant bias 
which goes roughly as 



E[9] — 9 ~ arccos 



is a 



,0)-l 



(77) 



Therefore, as the prior estimation gets accurate with in- 
creasing N, the separation of the two maxima of the like- 
lihood function remain constant. By choosing the maxi- 
mum closest to #o> one nas an exponentially small error 
probability. 

In summary, the optimal scheme goes as follows: 

1. Perform any phase measurement (not necessarily 
optimal) on N a (1/2 < a < 1) copies of the sig- 
nal state. From the outcomes, compute a pre- 
liminary estimation 9q, which has a typical error 
59 2 ~l/N a . 

2. Measure the quadrature with 9' = 9 + $(s)/2, 
where $(s) is given in Eq. (|62|l . on the remaining 
N = N — N a copies. This corresponds to setting 
the phase reference (the Local Oscillator) to 9'. 

3. The Maximum Likelihood Estimator is obtained by 
choosing the minus sign in Eq. (|76(l . 

Note that one could equivalently write 9' = 9q — <f>(s)/2 
and choose the plus sign in Eq. (|76[) . The variance of 
this estimator goes as 



Var[0 M LE] ~ 4 



2s 4 



1 



(1-s 4 ) 2 8((n) 2 + (™»' 



(78) 



and its bias vanishes asymptotically, therefore its MSE 
goes as 



MSE[(9 ; 



1 



MLEJ 



(79) 



which is the Heisenberg limit presented in Sec. [n] 

IX. CONCLUSIONS 

We have seen that the phase-number Heisenberg in- 
equality is valid regardless of the existence of any phase 
operator, provided (A9) 2 is regarded as a phase estima- 
tion variance. It is just a consequence of the Cramer- Rao 
and Braunstein-Caves inequalities. 

We have seen that asymptotically the Heisenberg limit 
(optimal estimation) can be attained by means of a 
(three-outcome) POVM adaptive measurement. The 
physical implementation of such a measurement is a de- 
manding open problem. We have introduced an approx- 
imate MLE that provides an outstanding simplification 
of the maximum likelihood analysis. Asymptotically this 
approximate MLE and the exact one are shown to be 
equivalent. 

We have shown that the Heisenberg limit can be also 
attained asymptotically by means of dyne measurements 
and, surprisingly, only one quadrature needs to be mea- 
sured. This provides a remarkable economy of resources 
as compared to POVM measurements such as an eight- 
port homodyne detection with squeezing in the auxiliary 
port. The relative phase between the Local Oscillator in 
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homodyne detection and the signal state has been com- 
puted and used to devise a two-step phase estimation 
scheme which attains the Heisenberg limit asymptoti- 
cally. 

Remarkably enough, optimality is achieved with local 
measurements. Hence, collective measurements prove of 
little use in the large sample limit. This is a consequence 
of quantum estimation theory applied to one-parameter 
problems. 

Most of the schemes available so far succeed in attain- 
ing the behavior Var[#] ~ (n)~ 2 for (n) large enough. 
The exact behavior is usually not considered in the lit- 
erature. This is, in our opinion, an important loophole 
in phase estimation, since one is usually interested in 
low energy states, where (n) is small. An exception is 
the paper by Yurke et al. |33j |. in which they introduce 
a scheme whose variance comes close to our bounds for 
large (n) |34|. However, our scheme is asymptotically 
optimal and beats, to the extent of our knowledge, any 
proposal presented up to this day. 
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APPENDIX A: AVERAGING OVER THE 
NUMBER OF INFORMATIVE OUTCOMES 7Vi nf 

In this appendix we compute the average of 
Ee,i\r inf [(#mle — 9) 2 } over the number of informative out- 
comes Ninf, i.e., 



e[(0mle - ef] = Y, E Wil , [(^mle - ef] ( Ni c ) P N ^(i - pf-"** 

JVinf=0 

N 

I / \ \ . 

(Al) 



The first sum runs from 1 to N, thus taking into account 
that the actual variance for the rare case Ni n { = is 
considered separately. We can compute the expectation 
value (l/N- m {) by expanding 1/Y in f around the expecta- 
tion value of iVj n f, which is (N in {) — pN, and keeping 
terms up to second order. We obtain 



JV 

E 



1 



N 



N ini \N inf 



p N ™(l-p) 



N-N int 



±( l + lZP-{l-pf 

Np\ Np. ' 



(A2) 



Inserting the value for p = 1 + 0(S9 2 ) yields 



Ef 



7MLE 



A(An) 2 N 



0(S9 2 



(A3) 
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